Indicators Ratings Parallel =  7 ms
Indicators Ratings =  2 ms
CPU compute ratings: 1 iteration avg time: 548 ms
 
 ---- 
cuda_malloc<d_ratings> 0.61
cuda_malloc<d_companies_map> 0.58
cuda_malloc<d_indicatorsRatings> 1.51
kernel<computeRatingsKernel> 0.89
kernelSegment<sort> 2.33
toHostMemory<d_ratings> 0.56
toHostMemory<d_companies_map> 0.58
toHostMemory<d_indicatorsRatings> 4.39
ranking 0.52
TotalRankingTime 12.3

cuda_malloc<d_ratings> 0.08
cuda_malloc<d_companies_map> 0.09
cuda_malloc<d_indicatorsRatings> 0.08
kernel<computeRatingsKernel> 0.93
kernelSegment<sort> 0.72
toHostMemory<d_ratings> 0.57
toHostMemory<d_companies_map> 0.56
toHostMemory<d_indicatorsRatings> 8.32
ranking 0.51
TotalRankingTime 12.16

cuda_malloc<d_ratings> 0.06
cuda_malloc<d_companies_map> 0.12
cuda_malloc<d_indicatorsRatings> 0.07
kernel<computeRatingsKernel> 0.87
kernelSegment<sort> 0.75
toHostMemory<d_ratings> 0.56
toHostMemory<d_companies_map> 0.57
toHostMemory<d_indicatorsRatings> 8.05
ranking 0.53
TotalRankingTime 11.88

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3792MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


